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Abstract 

We study threshold properties of random constraint satisfaction problems under 
a probabilistic model due to Molloy fTTlL We give a sufficient condition for the 
existence of a sharp threshold that leads (for boolean constraints) to a necessary 
and sufficient for the existence of a sharp threshold in the case where constraint 
templates are applied with equal probability, solving thus an open problem from 
0 

1 Introduction 

Classifying threshold properties of random constraint satisfaction problems is a 
problem that has been intensely studied in recent literature. The well-known Friedgut- 
Bourgain result 0 showed that 3-satisfiability has a sharp threshold, via a more 
general result on threshold properties of monotonic sets. 

For random satisfiability problems (i.e. constraint satisfaction problems with a 
boolean domain) we have obtained 0 a classification of thresholds of such proper¬ 
ties, under a random model that allows constraints of different lengths. The results 
from 0 intuitively show that satisfiability problems with a coarse threshold qual¬ 
itatively “behave like random Florn satisfiability”, a problem with a known coarse 
threshold 0. A drawback of these results is that the classification is not “struc¬ 
tural” (that is defined in terms of properties of the formula hypergraph), and thus 
does not offer suggestions for a suitable generalization to non-boolean domains. 
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Recently Molloy ill! has investigated a model of random constraint satisfac¬ 
tion that allows constraints of the same arity and unequal probabilities for the appli¬ 
cation of the given constraint templates (hence is only partly comparable with the 
results in (7J). He offers a nice structural condition that is necessary and sufficient 
for the existence of a weaker form of threshold property (he calls a transition). 

While Molloy’s model certainly has several remarkable features (such as the 
location of the transition in the region of formulas whose ratio between clauses 
and variables is constant), it was observed in GU that the necessary and sufficient 
condition for the existence of a transition is not sufficient for the existence of a 
sharp threshold. The counterexample involves binary constraints on {0,1,2,3} 
and a nonuniform probability distribution on the set of such templates. For the case 
of a uniform probability distribution he gave (Theorem 6) a sufficient condition for 
the existence of a sharp threshold. 

In this note we strengthen this latter result: We give a more general sufficient 
condition for the existence of a sharp thresholds, that allows to obtain a precise clas¬ 
sification of sharp/coarse thresholds for random satisfiability problems (problems 
with a binary domain), confirming thus a conjecture due to Creignou and Daude. 
The key to these results is a “monotonicity” property of the Friedgut-Bourgain 
method for proving the existence of a shaip threshold. We have first observed this 
property in 0, and it is, we believe, of independent interest. 

2 Preliminaries 

Throughout the paper we will assume familiarity with the general concepts of phase 
transitions in combinatorial problems (see e.g. GDI), random structures 0. Some 
papers whose concepts and methods we use in detail (and we assume greater fa¬ 
miliarity with) include Effl 

Consider a monotonically increasing problem A = (A n ). The two models used 
in the theory of random structures are: 

• The constant probability model F(n. p). A random sample from this model 
is obtained by independently setting to 1 with probability p each bit of the 
random string. 

• The counting model T(n, M ). A random sample from this model is obtained 
by setting to 1 M different bits chosen uniformly at random from the n bits 
of the random sample. 

As usual, for e > 0 let p e = p e (n) define the canonical probability such that 
P r xer(n,p £ (n)) [ x F A] = f . The probability that a random sample x satisfies prop- 


erty A (i.e. x £ A) is a monotonically increasing function of p. Sharp thresholds 
are those for which this function has a “sudden jump” from value 0 to 1: 


Definition 1 Problem A lias a sharp threshold iff for every 0 < e < 1/2, we have 
linin^oo = 0. A has a coarse threshold if for some e > 0 it holds that 


lim 


Pl-e(n)-p t (n) 


-ra-t-oo p 1 / 2 (n) 


> 0 . 


For satisfiability problems (whose complements are monotonically increasing) 
the constant probability model amounts to adding every constraint (among those al¬ 
lowed by the syntactic specification of the model) to the random formula indepen¬ 
dently with probability p. Related definitions can be given for the other two models 
for generating random structures, the counting model and the multiset model [0. 
Under reasonable conditions 0 these models are equivalent, and we will liberally 
switch between them. In particular, for satisfiability problem A, and an instance 
<I> of A, C/i (F) will denote its constraint density, the ratio between the number of 
clauses and the number of variables of F. To specify the random model in this 
latter cases we have to specify the constraint density as a function of n, the number 
of variables. We will use ca to denote the value of the constraint density ca(F) 
(in the counting/multiset models) corresponding to taking p = p \/2 in the constant 
probability model, ca is a function on n that is believed to tend to a constant limit 
as n —> oo. However, Friedgut’s proof 01 of a sharp threshold in />:-SAT (and our 
results) leave this issue open. 


Definition 2 Let V = {0,1,... , t — 1}, t > 2 be a fixed set. Consider the set of 

xk 

all 2 — 1 potential nonempty binary constraints on k variables X \,..., X^. We 

fix a set of constraints C and define the random model OS P(C). A random formula 
from CSPn iP (C) is specified by the following procedure: 

• n is the number of variables. 

• for each k-tuple of ordered distinct variables (x \,..., Xk) and each C E C 
add constraint C(x \,..., x /,.) independently with probability p. 

When the number of variables n is known (or unimportant) we will drop it from 
our notation, and write CSP p (C) instead. 

Remark 1 The model in Definition\2\differs from the model in sm in two respects: 

• It is a “constant probability ’’-type model (the one in SIB is a “counting”- 
type model). 




• More importantly, all templates C £ C are applied with the same proba¬ 
bility (in a probability V on the set of all templates is considered, and 
templates are instantiated according to this distribution). 

However, as discussed in UB (Remark 2) one can map Molloy’s model onto a 
modified version of the constant probability model. In particular Definition\2\cor- 
responds to this mapping when V is the uniform distribution, which is why we 
chose this definition as the starting point. 

Definition 3 If & is a constraint satisfaction problem, the formula hypergraph of 
<f> is the hypergraph H that has 

• the set of variables that appear in $ as vertices. 

• the sets of variables that appear together in a constraint of <f > as edges. 

H is tree-like if it is a connected acyclic hypergraph, and is unicyclic if there 
exists an edge e £ H such that H \ e is tree-like. 

3 Coarse and sharp thresholds of random generalized con¬ 
straint satisfaction problems 

In this section we study the shaipness of the threshold for random generalized 
constraint satisfaction problem defined by Molloy CE2- He defined a weaker type 
of threshold (he calls transition), and provided a necessary and sufficient condition 
for the existence of a transition, called very well-behavedness of the constraint set. 

Definition 4 A value 5 is 0-bad if there exists some canonical variable x, and 
constraint C £ C such that C \= (x t f 5). We say that 6 is j- bad if there exists 
some constraint C £ C and variable Xi such that C A (x t = 5) implies that some 
other variable xr of C must be assigned a j'-bad value, for some j' < j. 5 is bad 
if it is j-bad for some j and good otherwise. 

Definition 5 A set of constraints C is well-behaved iff: 

1. there exists at least one good value in V. 

2. for every S £ V there exists at least one constraint C £ C not satisfied by 
the assignment (5,5,... , 5). 

C is very well-behaved if, in addition to the previous two properties, satisfies 
the following property: any constraint formula from CSP(C ) whose constraint 
hypergraph is a cycle has a satisfying assignment where no variable is assigned a 
bad value. 


Molloy has proved that the condition that C is very well behaved is necessary 
for CSP{C) to have a sharp threshold. Unfortunately this condition is not also 
sufficient for the existence of a sharp threshold: there exist du pathological 
examples of very-well behaved binary constraint satisfaction problems with a 4- 
ary domain that have a transition but do not have a sharp threshold: 

Example 1 Let C consist of two binary constraints over domain {0,1, 2, 3}. The 
first constraint C\(x,y), forbids the pair (x,y) from taking values from the set 
(0, 0), (1,1), (2, 2), (3, 3). The second one , C 2 (x, y) forbids the situation when one 
of the variables takes a value from the set {0,1} while the other constraint takes 
a value from the set {2,3}. Molloy’s model allows for constraint templates to be 
applied with nonunifonn probabilities, in this case P(C\) = 1/3. P(Cf) = 2/3. 

Molloy’s counterexample involves non-uniform probabilities, so it would be 
tempting to conjecture 1 that at least in the case of uniform probabilities very well- 
behavedness is necessary and sufficient for the existence of a sharp threshold of 
CSP{C). Unfortunately not even this is true 2 : 

Example 2 Let K. consist of three constraints of arity 4 over domain {0,1, 2,3} 
defined as follows (with respect to the constraints in Example 0 -' 

K\{x,y,z) = Ci(x,y), 

K 2 {x,y,z ) = C 2 (x,y), 

and 

K 3 (x,y,z) = C 2 (z,y), 

The constraints are applied with uniform probability. It is easy to see that CSP(K) 
is equivalent to the constraint satisfaction problem from Molloy’s counterexam¬ 
ple 0 

Nevertheless, in what follows we will obtain a necessary condition for the exis¬ 
tence of a shaip threshold that completely solves the problem in the case of binary 
constraints applied with uniform probability. 

Definition 6 A set of constraints C is extremely well-behaved iff: 

1. C is very-well behaved. 

*we had an incorrect proof of this claim in an earlier version of the paper 

2 we thank an anonymous referee for this counterexample. 



2. There exists a mapping T from good values to constraints in C such that, for 
every good value 5, constraint I',) := T(<5) G C satisfies 


r« 5 (xi,... ,x k ) 1= (zi = S) V ... V {x k = 5). (1) 

Theorem 1 Let C be a set of extremely well-behaved constraints. Then CSP{C ) 
has a sharp threshold. 

The result of the theorem is incomparable with Molloy’s sufficient condition 
for the existence of a shaip threshold: our result does not imply the fact that 3- 
coloring has a sharp threshold |T), while his does. On the other hand his result is 
not strong enough to yield the Corollary below. 

We now apply the previous result to the case of boolean constraint satisfaction 
(satisfiability) problems: 

Definition 7 Constraint C 2 is an implicate of C\ iff every satisfying assignment 
for C\ satisfies G'a- 

Definition 8 A boolean constraint C strongly depends on a literal if it has an unit 
clause as an implicate. 

Definition 9 A boolean constraint C strongly depends on a 2-XOR relation if 
3i,j G {1,..., k} such that constraint “xi Xj” is an implicate of C. 

In this case we obtain the following explicit result, that totally settles the case of 
random satisfiability problems, thus solving the open problem from [3) and extend¬ 
ing the results from 0, where the result was shown to hold under the additional 
restriction that all constraint templates in set V are symmetric. 

Corollary 1 Consider a generalized satisfiability problem SAT(C) (that is not 
trivially satisfiable by the “all zeros” or “all ones” assignment). 

1. if some constraint in C strongly depends on one component then SAT(C) 
has a coarse threshold. 

2. if some constraint in C strongly depends on a 2XOR-relation then SAT(C) 
has a coarse threshold. 

3. in all other cases SAT(C ) has a sharp threshold. 


Proof. The first two cases were proved by Creignou and Daude in f3l. In the third 
case first, it is easy to see that both values 0 and 1 are good, since there is no 0-bad 
value (hence no bad value, otherwise C would strongly depend on one variable). 

Since constraints in C are not 0/1-valid and their domain is boolean it follows 
that condition 0 in the definition of extreme well-behavedness is satisfied. So all 
it is left to show is that C is very well behaved. 

Indeed, consider a formula <F whose associated hypergraph is a cycle. If $ were 
unsatisfiable then it would be minimally unsatisfiable (since all acyclic formulas 
are satisfiable). But Theorem 4.3 in j3| prevents that from happening, since for all 
minimally unsatisfiable formulas | Var(5)| < (k — 1)151 — 1, whereas for a cycle 
\Var(S)(k — 1)|<S|. 

□ 

4 Proof of Theorem fl] 

Our proof relies on the Friedgut-Bourgain criterion for the existence of a sharp 
threshold in any monotonic graph (or hypergraph) property A. 

It is a well-known (and easy to see) fact that if property A has a coarse threshold 
then there exists 0 < e < 1/2, p* = p*(n) E [pi_ e ,p e ] and C > 0 such that 

dp p (A) 

P ' dp Ip=p*(n) < C. I 2 ) 

Bourgain and Friedgut show 3 that 

Proposition 1 Suppose p = o(l) is such that condition [2 holds Then there is 5 = 
5(C) > 0 such that either 

p p (x E {0, l} n | x contains x' E A of size \x'\ < 10C} > 5 (3) 

or there exists x' 0 A of size \x'\ < 10C such that the conditional probability 

p p (x E A\x D x) > p p (A ) + 5. (4) 

Observation 1 We will, in fact, need one property that is not directly guaranteed 
by the Bourgain-Friedgut result as stated in but follows from an observation 

made in Hi For a finite set of words W define the filter generated by W, F(W) 
as 

F(W) = {x | (3y E W) with x D y }. 

3 in 0 the proposition is stated assuming for convenience that p = pi/ 2 , but this is not needed. 
We give here the general statement. 




Friedgut observed ( 0, remarks on pages 5-6 of that paper) that the statement 
of condition 0 can be strengthened in the sense that the set W of “booster ” sets x' 
satisfies p p (F(W )) = 17(1). Returning now to the case of random constraint sat¬ 
isfaction problems, since the number of isomorphism types of formulas of bounded 
size is finite, there exists 5 > 0 and a satisfiable booster formula F such that 

1. condition 0 holds with x' = F. 

2. Formula F appears with probability 17(1) as a subformula in a random for¬ 
mula in CSP p (C). 

A standard observation is that in the second condition of Proposition Q instead 
on conditioning on the presence of x' as a subset of x one can, instead, add it: 

Proposition 2 Suppose p = o(l) is such that condition 0 holds. Then there is 
5 = 6(C) > 0 such that either 

p p (x € {0, l} n | x contains x' G A of size \x'\ < 10 C} > 6 (5) 

or there exists x' f A of size \x'\ < 10C such that 

p p (x U x' € A) > p, p (A) + 6. (6) 

Finally, note that for random constraint satisfaction problems, because of the 
invariance of such problems under variable renaming, one only needs to add a 
random copy of x' . That is, the following version of Proposition |2]holds: 

Proposition 3 Suppose A = CSP(C) and p = off) is such that condition 0 holds. 
Then there is 6 = 6(C) > 0 such that either 

p p (x € {0, l} n | x contains x' G A of size \x'\ < 10C} > <5 (7) 

or there exists x' fL A of size \x'\ < 10C such that, if H denotes the formula 
obtained by creating a copy of x' on a random tuple of variables, then 

p, p fx U E G A) > p p (A) + 6. (8) 

To show that random CSP(C) has a shaip threshold, we will reason by contra¬ 
diction. Assuming this is not the case, one needs to prove that the two conditions 
in Proposition |3]do not hold. 

Suppose, indeed, that the first condition was true: that is, with positive prob¬ 
ability it is true that a random formula $ e CSP(C) contains some unsatisfiable 



subformula <f>' of size at most IOC. One can, therefore, assume that with positive 
probability <I> contains a minimally unsatisfiable formula <f>' of size at most IOC. 

On the other hand, with high probability all subformulas of a random formula 
<f> of size at most IOC are either tree-like or unicyclic. But because of well- 
behavedness, all formulas in CSP(C) that are tree-like or unicyclic are satisfiable. 
Therefore the first condition in Proposition[3]cannot be true. 

Assume, now, that the second condition is true: there exists a satisfiable for¬ 
mula F of size at most IOC, such that conditioning on a random formula in 
CSPp(C) containing a copy of F boosts the probability of unsatisfiability by a 
value bounded away from zero. Because of the two conditions in Observation |T| 
we infer that there exists a constant S > 0 such that adding F to a random formula 
<f> E CSP p (C) boosts the probability of ((((satisfiability of the resulting formula by 
at least 5. As we discussed, we assume that F occurs with probability 0(1) in a 
random formula in CSP p (C). Therefore F is tree-like or unicyclic (this argument 
was also used in CD). 

Definition 10 A unit constraint is a constraint (not necessarily part of the con¬ 
straint set C) specified by a condition X = 5, with X being a variable and 5 E D. 

Lemma 4.1 Every tree-like or unicyclic formula has a satisfying assignment W 
consisting only of good values. 

Proof. 

This is easily proved by induction on the number of clauses for tree-like for¬ 
mulas, even in a stronger form: if we set one of the variables to an arbitrary good 
value, we can still set the other variables to good values in such a way that we 
obtain a satisfying assignment. 

For a unicyclic formula we first set the variables appearing in its unique cycle 
to good values so that all such constraints are satisfied (this is possible since C is 
very well-behaved). We are now left with several tendrils, tree-like formulas on 
disjoint set of variables, the root of each such formula (the node appearing in the 
cycle) being set to a fixed good value, which we can satisfy as in the first case. □ 


Claim 1 If E satisfies condition © then there exists another formula G that is 
specified by a finite conjunction of unit constraints 

G=(X 1 = S 1 ) A ... A (X p = S p ), 

with all the values 6\.... ,d p E T> being good values, and that also satisfies 
condition ©• 


Proof. Formula S appears with constant probability in a random CSP(C ) for¬ 
mula with probability p and has constant size. Therefore E is either tree-like or 
unicyclic. The result follows easily from Lemma ETTl by replacing F with formula 
G consisting of the conjunction of unit constraints corresponding to a satisfying 
assignment of E with good values. Indeed, G is tighter than E, so adding a random 
copy of G instead of a random copy of E can only increase the probability that the 
resulting formula is unsatisfiable. □ 


The key to refuting condition © is to show that, if it did hold then, for every 
monotonically increasing function f(n) that tends to infinity, we can also increase 
the probability of unsatisfiability by a positive constant if, instead of condition¬ 
ing on x containing a copy of formula F we add f(n) random constraints from 
constraint set C. We first prove: 

Claim 2 Let 0 < r < 1 be a constant and let p be such that p p (CSP(C)) > 
t. Assume that r > 1 and that g \. <)■> ,... g r are good values such that, when 
(X\, X' 2 ,..., X r ) is a random r-tuple of different variables 


Pr (<b has a satisfying assignment with X\ = gi,, X r 



(9) 


Then there exists constant m > 1 (that only depends on k, r, t) such that, if 
rj denotes a formula from CSP(C ) obtained by adding, for each good value x, 
m. ■ r ■ 2 kr random copies ofT(x), then 


Pr( < h U t] is satisfiable ) < 


r 

2 


( 10 ) 


Proof. We will give a proof of Claim |2] that is very similar to that of the corre¬ 
sponding proof in |jT), thus obtaining the desired contradiction. 

For i <G {1,..., r} define A, to be the event that the formula has no satisfying 
assignment with the first i constraints X\ = g\,..., X r = g, holding. Also define 
Aq to be the event that <f> is not satisfiable. 

The hypothesis translates as the fact that both inequalities Pr(Af) > t and 
Pr(A r ) < | are true. Therefore 


= LXAM < T -11 = i. 

' Pr(A 0 ) ~ r 2 

Thus we have 

a r := Pr[A r \A 0 ] = Pr[A r -i\A 0 } + Pr[A r \A r -i AA 0 ] ■Pr[A r _ l \A 0 \ > ^ (11) 





Pr[A r \A r ^\ A Aq] = Pr{A r \A r _ \ is the fraction of variables that have to 
receive values different from g r if constraints X\ = gi ,..., X r _ \ = g r _i are 
added to <I>; let C r be the set of such variables. If instead of the last constraint 
we add a random copy of the constraint V(g r ) we spoil satisfiability as well when 
all the variables appearing in the constraint arc in the set C r . Denoting A r = 
(Pr[A r \A r _i]) k , this last event happens with probability A r /(1 — o(l)), so the 
probability that the resulting random formula is unsatisfiable is at least 

Pr ■= iMA-rl^o] + 1 _ A " • Pr\A^\M (12) 

Because of the convexity of the function f(x) = x k and constraint |T7] by 
applying Jensen’s inequality it follows that 

^ < a L ; = (Pr[A r _Mo] ■ 1 + Pr[A r \^] ■ Pr\A^\A- 0 ]) k < 

< Pr[A r ^\A Q ] • l k + Pr[A r \A^[] k • Pr[A~,\M = 

= (Pr[A r -\\A<f\ + X r ■ Pr[A r _i\Af\) = Pr ■ (1 + o(l)). 

Thus p r > ■ (1 — o(l)). The conclusion of this argument is that adding 

one random copy of T( 6 r ) instead of the r-th constraint lowers the probability of 
unsatisfiability to no less than • (1 — o(l)). Adding the copy of the constraint 
before the first r — 1 constraints and repeating the argument recursively implies 
the fact that, if instead of adding the r constraints to <I> we add r random copies of 
T( 6 i),.... T ( b r ) the probability of unsatisfiability of the resulting formula, given 
that ( I> was satisfiable, is at least 7 r = . Since the values b\,, b r can 

repeat themselves, the same is true if we add r random copies of T(x) for every 
good value x. 

Suppose now that we add r ■ m ■ 2 kr copies of each T(x) instead (that is, we 
repeat the random experiment m ■ 2 k ' times, for some integer m > 1). By doing 
so the probability that none of the experiments will make the resulting formula 

fc r 

unsatisfiable is at most (1 — 7 r ) m '“ . For some constant m this is going to be at 
most 1 — 7 j. This means that 

Pr(<h U 7 is satisfiable) < — (13) 

□ 


Condition E3 can be refuted directly, leading to a contradiction. This is done 
e.g. by Lemma 3.1 in Q]|. For convenience, we now restate this result: 













Lemma 4.2 For a monotone property 4 A let 

p(p) = Pr[G G r(n,p) has property A], 

p + (p, M ) = Pr[G\ U G 2 | Gi G T(n,p), G 2 G T(n, M) /ms property A}. 

Let A = 2l(n) C {0, l} n be a monotone property and M = M(n ) such that 
M = o{y/rvp). Then: 

\p(p) - p + (p,M)\ = o(l). 

Now it is easy to obtain a contradiction: consider a random formula // with 
/(n) clauses, for some /(n) —► oo. It is easy to show that the probability that r/ 
contains, for some good value x, less than M copies of constraint T (x) is o(l). So 
adding rj instead of the random formula in Claim |2] only increases the probability 
of the resulting formula being unsatisfiable. But then the conclusion of Lemma l4~2l 
directly contradicts that of Claim[2| 

5 Conclusions 

What made the proof work ? Its main steps arc, of course, quite similar to the 
proofs of similar results in El □ EHU] One element we want to highlight is a 
certain monotonicity property used in the proof of Claim[2] since it already proved 
useful in obtaining further insights on classifying threshold properties of random 
constraint satisfaction problems □a, and can reasonably be expected to help in 
obtaining a complete classification: In Claim |2] we have only used the fact that for 
every good value hi there exists a constraint Cs 1 [x\ ...., X^) in the image of map 
T that implies the constraint (X] = hi) V ... V (X/^ = hi). This means that as long 
as the other steps of the proof continued to work we could have proved the Claim 
if the constraint Cs 1 was exactly the constraint (X\ = hi) V ... V (Xf. = hi). This 
provides a general strategy for proving sharp threshold results: 

1. identify a “base case” B for which the analog of Claim[2]can be proved. 

2. If we are given a set of constraints C that are tighter than the ones of the base 
case B, to prove that CSP{C) has a shaip threshold it is enough to verify 
that all the other steps of the proof still hold. The analog of Claim[2]will now 
follow from the corresponding claim for the base case B. 

4 Achlioptas and Friedgut assume A to be a monotone graph property, but this fact is not used 
anywhere in their proof. 



To sum up, we have proved a sufficient condition for the existence of a shaip 
threshold for random constraint satisfaction problems that completely solves the 
boolean case. Example |2] showed, however, that the results could not be extended 
to all very-well behaved sets of constraints. Obtaining a necessary and sufficient 
condition for the existence of a sharp threshold in random constraint satisfaction 
problems is an interesting open problem. 
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